Selective all-pole modeling of degraded speech using M-band decomposition
نویسنده
چکیده
This paper describes a speech enhancement system which exploits both timeand frequency-localized behavior. The local characteristics are obtained from stationary regions selected by M-band decomposition with an adaptive analysis window. The spectrum of each selected region is estimated with an all-pole model. In order to model only the spectral region of interest, Selective Linear Prediction (SLP) is used. By modeling the local spectrum, either independently or dependently, with respect to other enhanced spectral regions, and adjusting the model order to the local characteristics, a balanced-tradeoff between noise reduction and speech distortion can be achieved.
منابع مشابه
All-pole Modeling of Wide-band Speech U Polynomial
The autocorrelation function of the all-pole filter given by the conventional linear prediction (LP) matches exactly the autocorrelation function of the input signal between indices 0 and m, when the prediction order equals m. This study describes a recently developed technique, Weighted-Sum Line Spectrum Pair (WLSP), where an all-pole filter is defined by using a sum of weighted LSP (Line Spec...
متن کاملA hybrid sub-band sinusoidal coding scheme
This paper describes a hybrid sub-band speech coding scheme based on sinusoidal coding and CELP. Purely voiced speech is encoded using sinusoidal coding techniques and phase information is selectively transmitted. For mixed and unvoiced speech, the lower band is processed by sinusoidal coding algorithms while the upper band is encoded using CELP. To accommodate the extra bandwidth required by t...
متن کاملSpectral modification for concatenative speech synthesis
Concatenative synthesis can produce high-quality speech but is limited to the allophonic variations and voice types that were captured in the database. It would be desirable to modify speech units to remove formant discontinuities and to create new speaking styles, such as hypoor hyper-articulated speech. Unfortunately, manipulating the spectral structure often leads to degraded speech quality....
متن کاملOn the effects of short-term spectrum smoothing in channel normalization
We present a simple analysis showing that channel normalization techniques are less eeective when applied to spectral energies obtained by (weighted) summation of components of the short-time Fourier power spectrum of speech. We show that applying channel normalization processing prior to critical band integration or linear predictive all-pole modeling improves the eeectiveness of the techniques.
متن کاملHarmonic weighting for all-pole modeling of the voiced speech
A new distance measure for all-pole modeling of voiced speech is introduced in this paper. It can easily be integrated within the concept of discrete Weighted Mean Square Error (WMSE) all-pole modeling, by a suitable choice of the modeling weights. The proposed weighting will address the problems such as: harmonic estimation reliability, perceptual significance of the harmonic and the model mis...
متن کامل